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TO ALL WHOM IT MAY CONCERN: 

Be it known that Anthony P. Shuber, a citizen of the United States of America, residing at 
1 1 Grant Street, Milford, Massachusetts, in the United States of America, has invented new and 
useful improvements in: 

METHODS FOR DETECTING CONTAMINATION 
IN MOLECULAR DIAGNOSTICS USING PCR 

t 

of which the following is a specification. 
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METHODS FOR DETECTING CONTAMINATION IN 
MOLECULAR DIAGNOSTICS USING PCR 

This application claims the benefit of the filing date of provisional application Serial 
No. 60/063,219, filed October 23, 1997. 

Background of the Invention 

The polymerase chain reaction (PCR) is a widely-used clinical laboratory procedure 
for sequence-specific target amplification. However, contamination is an ongoing problem. 
For many PCR applications, it is essential that the only DNA that enters the reaction is the 

template to be amplified. 

Increases in the sensitivity and specificity of PCR have enabled analysis of 
heterogeneous DNA (e.g., from tumor biopsies, stool). The DNA to be amplified is typically 
a rare event in the context of a heterogeneous sample. However, as the degree of sample 
heterogeneity increases, the tolerable threshold of background (signal generated from a 
negative control sample) becomes increasingly lower. This is necessary to retain a sufficient 
signal to noise ratio between positive clinical samples and negative control samples within an 
assay, and therefore to retain high confidence in the assay results. The end result of applying 
PCR to more heterogeneous DNA environments is a reduced tolerance for pre-PCR 
contamination from previous amplified material. Currently, there are three methods applied 
to prevent PCR contamination: (1) physical separation of the sample, pre-PCR setup, and 
post-PCR manipulations; (2) use of Uracil DNA-glycosylase and dUTP instead of dTTP, and 
(3) the use of UV irradiation. 

Thousands of samples may be analyzed in a single clinical assay with multiple PCR 
negative controls added. In this context, an investigator relies on the presence or absence of 
amplified product within a limited number of negative control samples to confirm the origin 
of amplification products observed in experimental samples. If only one PCR negative 
control sample is positive, the entire assay is invalid, and must be repeated. In an assay 
containing 1000 samples, each sample must be run with another set of negative controls when 
contamination is observed. 



However, the mere lack of amplification product within the PCR negative control is 
not determinative of a positive PCR result in a sample in which contamination is rare. This 
kind of sporadic contamination is especially problematic in an extremely large throughput 
assay in which 5 to 10 negative controls are run for approximately every 1000 samples. 
Statistically, the likelihood of sporadic contamination in, for example, 1000 samples will not 
be detected in only 5 negative controls. Sporadic contamination is also a significant problem 
when PCR based analyses are performed on heterogeneous (rare event analysi) samples in 
which a positive result is generated from, for example, 1-5% of the total amplification product 
present within the sample. Generally, within a PCR based inherited disease diagnostic assay, 
given the 50% heterogeneity that exits in any genomic DNA sample, a 1-5% increase in signal 
in a true negative sample would appear as a slight increase in background, but would not 
indicate a false positive result. However, within an assay involving samples with 
heterogeneous populations of DNA, a 1-5% positive signal generated by a true negative 
sample would result in a false positive. 

In addition, even within an inherited disease diagnostic assay, if there were 1000 
samples analyzed and 5-10 negative control PCR reactions were run in parallel, and one or 
two of the negative control samples were positive, results from any of the samples themselves 
would be compromised. If the contamination of the PCR negative control samples is truly 
sporadic, then repeat analysis of all 1000 samples is probably not necessary and extremely 
costly. The lack of amplification product within the PCR negative control samples is not 
determinative that a positive PCR result within an experimental sample set is not from rare 
(sporadic) contamination that has occurred in only a few samples within the assay (and not 
due to the negative controls run in parallel). 

In many assays, "normal" PCR contaminants (e.g., resulting from purification 
problems) are an even greater hindrance and leads to decreased sensitivity of the assay. These 
"normal" PCR contaminants can lead to false negative results that undermine the accuracy of 
(and confidence in) the particular assay. 

Therefore, methods are needed for performing clinical analyses on samples of DNA 
heterogeneity (e.g. sporadic cancer detection) such that sporadic contamination from previous 



amplification product or "normal" PCR contaminants do not result in false positive or false 
negative results. 

Summary of the Invention 

The invention provides methods for determining whether contamination from previous 
5 amplification product exists in products of a polymerase chain reaction (PCR). Specifically, 
the invention relates to methods for detecting the presence of PCR products (amplicons) that 
would not be present but for contamination from previous amplification product in the PCR 
sample. Methods of the invention are useful for detection of contamination in any PCR. 
Furthermore, the methods of the invention are useful to avoid false negative and false positive 

10 results and the decreased assay sensitivity associated with PCR contamination. However, 
such methods are especially useful in heterogeneous samples, particularly samples in which 
the detection of a rare event (i.e. a small subpopulation of a nucleic acid in a heterogeneous 
sample) is the ultimate object of the PCR. 

In a preferred embodiment, methods of the invention comprise the utilization of 

15 optimal primer construction for PCR. Accordingly, in a highly-preferred embodiment, 
methods of the invention comprise conducting a first amplification using one or more 
(preferably two) chimeric primers. A chimeric primer, for purposes of the invention, is one 
comprising a primer having substantial sequence specificity with the template to be amplified 
(a template-specific sequence) and a 5' end that is referred to herein as a "contamination 

20 detection sequence" (CDS). Methods further comprise conducting a second, parallel, 

amplification reaction using at least one (preferably two) contamination detection sequence 
(without the attached template-specific sequence, or with only a minimal number of template- 
specific bases, as described below) as a primer. Finally, detection of an amplicon in the 
second amplification reaction means that the sample is contaminated with previous 

25 amplification product, because no such amplicon would be generated absent contamination. 

A contamination detection sequence may be any sequence (regardless of length) that 
does not have substantial sequence specificity (i.e., does not hybridize under stringent 
conditions) with the template. See Figure 2 A. Accordingly, methods of the invention 
comprise conducting two amplification reactions on each sample suspected to contain a 

30 template sequence, the amplification of which is desired. The first reaction utilizes the 
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chimeric primers described above, and yields the desired amplicon (which may then be 
sequenced, probed, etc.). The second amplification reaction utilizes only the contamination 
detection sequence as primers (which are non-specific relative to the template). Any amplicon 
produced in the second reaction is evidence of contamination with previous amplification 
5 product. 

In a preferred embodiment, the amplification reaction is selected from PCR, reverse 
transcriptase PCR, and Q-PCR. Also in a preferred embodiment, the sample containing 
nucleic acid to be amplified is a stool sample. A stool sample contains a highly- 
heterogeneous population of nucleic acids. Human nucleic acids represent a small portion of 

10 the nucleic acid present in stool. More specifically, a stool sample may contain molecular 

indicia of cancer, specifically colorectal cancer, that occurs as a small subpopulation (typically 
on the order of about 1% at early stages of cancer or precancer) of the total nucleic acid in the 
stool. Sensitive assays (which may or may not involve amplification) have been developed to 
detect such small subpopulations. See, e.g., U.S. Patent No. 5,670,325, incorporated by 

15 reference herein. Amplification of a nucleic acid containing a mutation indicative of cancer or 
precancer may be confounded by PCR contaminants in the sample, especially if the detection 
limits of the assay are near or above the percent contaminants in the sample. The present 
invention detects PCR contaminants, thus allowing a given PCR reaction to be excluded from 
analysis on a sample-by-sample basis. Thus, if four separate samples are taken for 

20 amplification, each sample is divided into two subsample aliquots, one of which is amplified 
using chimeric primers, and in the other, the contamination detection sequence primers are 
used to check for contamination in the aliquot. Therefore, each aliquot of sample for which 
amplification is sought has its own quality control assay. 

These and other advantages and aspects of the invention will be understood upon 

25 consideration of the following detailed description thereof. 

Brief Description of the Drawing s 

Figure 1 shows a schematic diagram of the chimeric primers used in the present 
invention. 
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Figure 2A shows a schematic representation of PCR amplification using chimeric 
primers. 

Figure 2B shows a schematic representation of PCR amplification using CDS-specific 
primers on uncontaminated sample (resulting in no amplified product). 
5 Figure 2C shows a schematic representation of PCR amplification using CDS-specific 

primers on contaminated sample (resulting in amplified product). 

Figure 3 shows a schematic diagram of an assay performed using the methods of the 
invention. 

Detailed Description of the Invention 

10 Methods of the invention comprise optimal PCR primer design. Normally, 

target-specific PCR primers are complementary to sequences present within the target. The 
target sequence is part of, and endogenous to, the target DNA analyte (the analyte can be any 
target DNA of interest: human DNA, viral DNA etc.) and is therefore one that is expected to 
be present in all of the experimental samples (in the case of inherited disease diagnostics), or 

15 at least in all of the positive samples (in the case of infectious disease diagnostics). 

Methods of the invention comprise PCR primers that have a non-homologous or 
non-complementary "contamination detection sequence" ("CDS") attached to the 5' end of 
target-specific PCR primers. (See Fig. 1). The CDS region is neither homologous to, nor 
complementary to, any endogenase (template) sequence. Therefore, following any PCR 

20 involving chimeric primers, the CDS becomes incorporated into the PCR products 

(amplicons) generated from the PCR. (See Fig. 2A). Therefore, only PCR products from 
previous reactions have the CDS region contained within them. 

In the present invention, a sample to be assayed for a particular analyte (which may be 
one of hundreds or thousands in a single clinical assay) is analyzed by two distinct, parallel 

25 amplification reactions. In a first reaction, PCR is performed on the sample using chimeric 
sequences that contain a template-specific sequence (a sequence substantially complementary 
to a specific DNA analyte) and a 5' CDS sequence (the CDS sequence is contiguous to the 5' 
end of the template specific sequence). In a second reaction, PCR is performed on the sample 
using primers that are specific for previously amplified amplicons containing the CDS 
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sequence. The CDS primers (1) may be sequences that are specific for the CDS sequence 
alone (i.e., with no cross-reactivity to the target analyte sequence), or (2) the primers can 
comprise the CDS sequence with additional bases attached to the 3' end. From one to nine 
bases may be added at the 3' end of the CDS sequence and may serve to provide additional 
5 specificity. The CDS primers will not effectively prime the target analyte sequence. 

The presence of amplified product (or amplicon) as a result of the PCR using the 
chimeric primer indicates a positive result for the presence of the particular analyte, but may 
also reflect contamination from previous PCR product. 

The PCR with CDS-specific primers acts as negative control. Because the only 
10 samples that contain the CDS sequence will be those generated by previous PCR events 
within the lab, the presence of amplified product after PCR with the CDS-specific primers 
indicates that that particular sample is contaminated, and the results should be discarded. The 
lack of amplified product reflects the absence of PCR-based contaminants in a particular 
sample as portion of a sample. 
15 Accordingly, the present invention eliminates the degree of repeat sample analyses 

performed within, for example, high throughput assays by specifically identifying only 
samples that have contamination. 

The invention also provides additional protection needed within assays of 
heterogeneous samples, where sporadic contamination is more likely to be the source of 
20 contamination. It enables identification of the specific samples within the essays that are truly 
contaminated. The invention also provides a sample specific internal control for determining 
PCR product contamination. 

The present invention is suitable for use with a variety of experimental samples that 
may contain a particular DNA analyte. Biological samples may be used in the present 
25 invention, including blood and stool samples. 

The methods of the present invention are especially suitable for applications such as 
inherited disease diagnostics and related kits; infectious disease diagnostics and related kits; 
clinical assays involving sporadic cancer detection (e.g. testing DNA from stool for colorectal 
cancer) and related kits; and other w rare event" clinical assay and related kits. 
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Example 1 

Stool sample is collected and prepared as described in U.S. Patent No. 5,741,650 and 
in copending applications Serial No. 08/876,638, filed June 16, 1997, both incorporated by 
reference herein. Specifically, stool is collected and prepared so that a sample contains at 
5 least a cross-sectional portion of a stool voided by a patient. Alternatively, whole stool may 
be used. The sample is homogenized in a physiologically compatible buffer (e.g., having a 
final concentration: 500 mM Tris, 16 mM EDTA and 10 mM NaCl, pH 9.0), using an Exactor 
II shaker for 15 minutes. A 20% SDS solution is added to a final concentration of 0.5%, 
Proteinase K is also added to a final concentration of 500 jag/ml and incubated at 37°C. 

10 For exemplification, sequence-specific primers suitable for PCR are chosen to . 

correspond to a portion of the Jcras gene sequence. These are: Primer 1 (SEQ. ID. NO. 2): 
5 ' -GATTCCTACA GGAAGC AAGT AGT AATTG-3 ' , and Primer 2 (SEQ. ID. NO. 3): 
5 ' -TAATGGTGAAT ATCTTC AAATG ATTTAG-3 ' . 

The contamination detection sequence (CDS) is 5 5 - 

15 GCGGTCCC AAAAGGGTC AGT-3 ' (SEQ. ID. NO. 1). The chimeric primers contain the 
20-nucleotide CDS sequence attached (i.e. contiguous) to the 5' end of the individual 
sequence-specific primers (primer 1 or primer 2). Oligonucleotides are HPLC purified and 
quantitated by spectrophotometry. 

PCR amplifications are performed using from about 4 jal (1-2 |ig) to about 10 |il (5-50 

20 ng) of genomic DNA prepared from stool samples. PCR amplifications are done using a 

Perkin-Elmer 9600 Thermal Cycler (Perkin-Elmer, Norwalk, CT) for 28 cycles with ramping 
(94°C/10-sec hold with 48-sec ramp, 60°C/10-sec hold with 36-sec ramp, 72°C/10-sec hold 
with 38 sec ramp). Reactions (50 jjlI) are carried out in 1 x PCR buffer (10 mM Tris-HCl at 
pH 8.3, 50 mM Kcl , 1 .5 mM MgCL 2 ), 200 |im dNTPs, 2.5 units, of Tag polymerase (Perkin- 

25 Elmer, Norwalk, CT). 

For PCR product analyses, 8 jul of the amplification reactions is loaded directly onto a 
2% ethidium bromide stained agarose gel and electrophoresed at 250 V for 90 min. The 
amplification products are visualized with a UV transilluminator (Fotodyne, New Berlin, WI) 
and photographed with an Alpha Innotech IS-500 Digital Imaging System version 1,97 (Sun 

30 Bioscience Inc., Branford, CT). 
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A first PCR is performed in a first aliquot of stool sample (containing kras) using 
chimeric primers, wherein the forward primer comprises primer 1 with the CDS contiguous 
with its 5' end; and the reverse primer comprises primer 2 with the CDS contiguous with its 
5' end. The first PCR results in an amplicon comprising both chimeric primers and the 
5 intervening template sequence. 

A second PCR is performed on a second aliquot of stool sample in which both the 
forward and reverse primers are the CDS. If contamination from previous PCR cycles is 
present in the sample, the second PCR will product an amplicon. If no contamination is 
present in the sample, no amplicon is observed in the second aliquot. 
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